Stepwise classification of cancer samples using clinical and molecular data Additional file

نویسندگان

  • Askar Obulkasim
  • Gerrit Meijer
  • Mark van de Wiel
چکیده

where ρ is the correlation between x1 and x2. For each random number we sample ρ from the uniform distribution (0, 0.5). We set this correlation range because we want to illustrate the performance of the indirect mapping in a setting of low correlation between the two data types. In order to represent noisy data types we added independent standard Gaussian noise to both x1 and x2. We treat x1 and x2 as two different data types and generated the associated binary responses by following way:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

stepwiseCM: An R Package for Stepwise Classification of Cancer Samples Using Multiple Heterogeneous Data Sets

This paper presents the R/Bioconductor package stepwiseCM, which classifies cancer samples using two heterogeneous data sets in an efficient way. The algorithm is able to capture the distinct classification power of two given data types without actually combining them. This package suits for classification problems where two different types of data sets on the same samples are available. One of...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Consensus Molecular Subtypes of Colorectal Cancer and their Clinical Implications

The colorectal cancer (CRC) subtyping consortium has unified six independent molecular classification systems, based on gene expression data, into a single consensus system with four distinct groups, known as the consensus molecular subtypes (CMS); clinical implications are discussed in this review based on articles relevant to the CMS of CRC indexed in PubMed as well as the authors’ own ...

متن کامل

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

 In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011